北京大学数学学院

您所在的位置:网站首页 北京information science 北京大学数学学院

北京大学数学学院

2024-05-28 02:54| 来源: 网络整理| 查看: 265

机器学习与数据科学博士生系列论坛(第六十一期)—— Adaptive Algorithms in Linear Bandits

报告人:Jingxin Zhan (PKU)

时间:2023-11-09 16:00-17:00

地点:腾讯会议 551-1675-5419

摘要: Adaptive bandit algorithms that automatically exploit certain specific characteristics of environments have been studied for many years. At the higher level, the proposed algorithm adapts to a variety of types of environments, i.e., adversarial environments and stochastic environments. At the lower level, in each of the adversarial and stochastic regimes, the proposed algorithm adapts to certain environmental characteristics, for example, the total quadratic variation in loss sequence, thereby performing better.

In this talk, we will introduce adaptive bandit problems in the linear bandit setting, following the recent two works from COLT. They both apply the framework of follow-the-regularized-leader (FTRL) methods, which has already yielded many important results in the original bandit setting.

论坛简介:该线上论坛是由张志华教授机器学习实验室组织,每两周主办一次(除了公共假期)。论坛每次邀请一位博士生就某个前沿课题做较为系统深入的介绍,主题包括但不限于机器学习、高维统计学、运筹优化和理论计算机科学。



【本文地址】


今日新闻


推荐新闻


CopyRight 2018-2019 办公设备维修网 版权所有 豫ICP备15022753号-3